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ABSTRACT 

This paper introduces new phase-space models of dwarf spheroidal galaxies (dSphs) . 
The stellar component has an isotropic, lowered isothermal (or King) distribution 
function. A physical basis for the isotropization of stellar velocities is given by the 
theory of tidal stirring, whilst the isothermality of the distribution function guarantees 
the observed flatness of the velocity dispersion profile in the inner parts. For any 
analytic dark matter potential - whether of cusped or of cored form - the stellar 
density and velocity dispersion are analytic. 

The origin of the observational correlation between half-light radius and line 
of sight central velocity dispersion a Pj o is investigated. We prove that a power-law 
correlation i?h oc a® can exist if, and only if, the dark halo potential is a power-law of 
the radius. Although a power-law is a good approximation in the central parts (D = 2 
for a Navarro- Frenk- White halo, D = 1 for cored halos), the theoretical correlation 
curve between i?h and ct Pi o dramatically steepens at larger half-light radii. Using our 
phase space models, we show that different dark halo profiles - whether cored or 
cusped - lead to very similar mass estimates within one particular radius, ~ 1.7i£h- 
The formula for the enclosed mass is M(< 1.7i?h) is ~ 5.8tJp Rh/G and extends out 
to much larger radii than previous investigations. This is a tight result for models 
with a flatfish projected velocity dispersion profile (out to several half-light radii). 
We show that deviations between mass measures due to different density profiles are 
substantially smaller than the uncertainties propagated by the observational errors on 
the half-light radius and central velocity dispersion. We produce a mass measure for 
each of the dSphs and find that the two most massive of the Milky Way dSphs are the 
most luminous, namely Sgr (M(< 1.7i? h ) ~ 2.8 xlO 8 M ) and Fornax (~ 1.3xfO 8 M ). 
The least massive of the Milky Way satellites are Willman 1 (~ 4 x 10 5 M Q ) and Segue 
1(~6X 1O 5 M ). 
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1 INTRODUCTION 

The dwarf spheroidal (dSph) galaxies surrounding the Milky 
Way have been the subject of intensive observational pro- 
grams in recent yea rs. Thanks to the l abours of a number 
of groups (see e.g.. iMateo et al.lll993l: iKlevna et alj |2002| . 
120041 : IWilkinson et al.ll2004l : IWalker et al.ll2009cl h radial ve- 
locity surveys with multi-object spectrographs have now 
provided datasets of thousands of stars for the bright dSphs, 
like Fornax and Draco. Early ind ications that the ve locity 
dispersion profiles might be flat (|Klevna et al.| [2001) have 
now been confirmed for all the bright dSphs out to the ra- 
dius at which the mean sur face brightness falls to the back- 
ground l|Walker et al.ll2007l '). 

The huge interest in the dSphs is of course provoked by 
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the enormous mass-to-light ratios inferred from their stel- 
lar kin ematics. Ever since the pioneering work of lAaronsonl 
(1983), it has been apparent that the dSphs are the most 
dark matter dominated systems in the Universe. From a 
theoretical perspective, the substantial dark matter content 
makes the dSphs relatively simple. They are typically com- 
posed of intermediate-age to old stellar populations appar- 
ently embedded in massive dark halos. In the bright dSphs, 
star formation therefore ceased many dynamical times ago, 
and so the stellar content should be well-mixed in the po- 
tential of the dark halo. This therefore offers one of the best 
opportunities to learn about the structure of dark halos in 
the local universe. 

Since 2005, there has been a succession of discov- 
eries of very faint Local Group dwarfs in data from 
the Sloan Digital Sky Survey. This includes at least ten 
new dwarf spheroidals (jWillman et al.l l2005al : IZucker et al.l 
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three satellites that have properties intermediate between 
dwarf galaxies and globular clusters l|Willman et al.l [2005b; 
iBelokurov et all l2007t 120091 ). All these objects have sur- 
face brightnesses and luminosities lower than any previously 
known galaxies, and consequently have come to be known 
collectively as the ultrafaints. It is often assumed, although 
without much evidence, that the ultrafaints are the low lu- 
minosity counterparts of the classical dwarfs. 

It is fair to say that the modelling of dSphs has not kept 
pace with the march of the observational data. A number 
of authors have inferred properties of the dSphs based on 
the Jeans equations. Typically, this takes the form of as- 
suming a parametric light profile for the stellar component 
and inferring the velocity dispersion from the Jeans equa- 
tions given an assumed law for the dark m atter halo, of- 
ten o f cusped or Navarro-Fre n k- White form llStrigari et al.l 
20081: iPenarrubia et all |200S| ; IWalker et all l2009d ). This is 
a legitimate procedure, although it does have substantial 
drawbacks. First, there is no guarantee that a physical dis- 
tribution function exists for the model. For example, it is 
not possible to embed an isotropic cored stellar profile in a 
Navarro-Frenk- White halo, even though the Jeans equations 
yield a solution - as the configu ration falls foul of th e Central 
Velocity Dispersion Theorem l|An fc Evansl (|2009l ); see also 
Ciotti & Pellegrini (1992) for an earlier related result). Sec- 
ond, the luminous and dark matter profiles are both posited 
a priori and it is therefore unlikely that this approach will 
lead to any insight beyond the starting assumptions. There 
is no physical connection between the luminous and dark 
matter, other than the fact that the velocity dispersions can 
support the model against gravitational collapse. 

Here, we shall take a different approach based on phase 
space modelling. This is harder than Jeans modelling and 
has been pursued less often, but is also more powerful. 
A significant previ o us as sault on the problem was made 
by I Wilkinson et all (2002), although with a restricted class 
of models in which the baryonic and dark matter have the 
same characteristic scalelength. The flatness of the veloc- 
ity dispersio n profiles of d S phs, a s is evident in the impres- 
sive data of IWalker et al.l (|2007l ). suggests that an obvious 
starting point for the dSph stars is an is otropic, isothermal 
distribution function l|Evans et all [20091. Better still is to 
use their spatially limited analogues, the lowered or quasi- 
isothermals, which have tidal radii imposed by the Milky 
Way potential. These distribution functions a re familiar in 
the modell ing of self-gravitating star clusters (jMichie 1963; 
iKind fl966l ) . Let us again emphasise that this approach is 
tailored to give models that have flatfish projected velocity 
dispersion profiles out to several half-light radii. The stellar 
component of the dSphs is of course not self-gravitating, so 
we will develop the theory of lowered or quasi-isothermal dis- 
tribution functions embedded in dark matter haloes. Notice 
that this gives a physically motivated starting point which 
ensures the flatness of the velocity dispersion profiles in the 
inner parts of the dSphs. There is also a physical connec- 
tion between the light profile and the dark matter, as the 
relaxation of stars towards a quasi-isothermal distribution 
function takes place in the dark matter potential. The same 
distribution in energy space gives rise to different light pro- 
files in different dark matter potentials. 



Since detailed photometric and kinematic profiles are 
available only for a few of the dSphs, we do not model 
each galaxy individually. Rather, we take advantage of the 
fact that half-light radii and central velocity dispersions are 
instead available for at least 28 dSphs, orbiting either the 
Milky Way or the Andromeda galaxies. Observational data 
show a clear correlation between these two physical quan- 
tities, highlighting a connection between the length scale 
of these systems, the ir kinematic propert ies and hence their 
dark matter content |Walker e t al. 2009cj). Although the ex- 
istence of the correlation is clear-cut, its precise form and 
origin is open to some dispute. 

We begin in §2 by re- vis iting the dat a on ha lf-light 
radii and velocity dispersion. IWalker et al.l (|2009d ) inter- 
preted the data as a power-law correlation, and analysed 
the consequences in terms of a universal halo profile for all 
the dSphs. We show that, in such a context, a strict power- 
law correlation necessarily implies that the universal gravita- 
tional potential is a power-law of the radius. In other words, 
for realistic halo models, the correlation always shows de- 
viations from the power-law form at large half-light radii. 
These residuals contain important physical information on 
the form of the dark halo potential. 

The next two sections sketch the theoretical framework 
to analyse the correlation. In §3, we derive the behaviour of 
the correlation in the central parts of cusped and cored uni- 
versal halos, by assuming an isothermal distribution func- 
tion for the luminous stellar components. Although these 
asymptotic results hold good if the luminous material is em- 
bedded deep within the dark halo, they eventually became 
unreliable. Accordingly, §4 develops the theory of quasi- 
isothermal distribution functions in dark halo potentials, 
which enable us to extend the correlation into the regime 
where the power-law breaks down. This gives us families of 
distribution functions that build dSphs, for which the veloc- 
ity dispersion profile is flat out to several multiples of the 
half-light radii and for which the correlation between veloc- 
ity dispersion and half-light radius is essentially the same, 
modulo scaling transformations. 

We return to the hypothesis of universality, and fit a 
single dark halo model to the data on half-light radius and 
velocity dispersion in §5. It is perhaps better though to per- 
form an object by object analysis. This gives predictions 
on the mass of the dark halo for each dSph in §6. We show 
that our modelling gives robust mass estimates out to about 
1.7 times the half-light radius. This strong result is a direct 
consequence of using distribution function modelling for the 
luminous component instead of a parametric density profile. 
This eliminates, by construction, all non-physical solutions 
that a Jeans analysis cannot exclude. 



2 THE CORRELATION 
2.1 Walker's Ansatz 



IWalker et all (|2009cf) looked for a correlation between the 
half-light radius and central velocity dispersion of the form: 



Rh 
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C 
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(1) 
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or, equivalently, 
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D log 



Vp,0 

kms~ ] 



+ logC 



(2) 



where the coefficient C and exponent D are chosen to give 
the best fit to the data on the dSphs. Here, Rh is the pro- 
jected half-light radius, that is, the radius of the projected 
cylinder which contains a half of the total luminosity of the 
system, whilst <t Pi o = o" p (0) is the projected or line of sight 
velocity dispersion at the centre. 

We will re- visit the fitting shortly, but it is worth explor- 
ing at outset the consequences of a strict power-law correla- 
tion like eqn ((TJ). It has been claimed that the dSphs could 
actually be characterized by some kind of physical universal- 
ity, for example concerning a common mass scale (see e.j 
iMateo et af1ll993l : iGilmore et ai1l2007l : IStrigari et ai1l200S 
The hypothesis of the uniformity of the properties of the 
dark matter halos embedding the local dSphs constitutes an 
obvious first step in trying to grasp the physical meaning 
of any correlations. Let $(r) be the spherically symmetric 
gravitational potential characterizing all the dSphs, and let 
p*(r) be the stellar density distribution of any dSph (they 
can be different). Given the overwhelming preponderance of 
dark matter in dSphs, the potential well $ is supposed to 
be generated by the dark matter halo only. The luminous 
components are test or tracer particles only. 

Each luminous component may be described by its dis- 
tribution function /*, which we assume to have the same 
functional form for all the dSphs. There is no evidence 
for anisotropic velocity dispersions, so we make the sim- 
plest possible assumption of isotropy. The distribution func- 
tion /„ must be a function of the energy only, namely 
/, = f t {E/a 2 ), where a is a constant. As a consequence, the 
luminous density distribution is a function of radius through 
the gravitational potential only: 

"$(r)" 



p*(r) 



(3) 



Here, we do not need to fix the properties of the distribution 
function /* , and thus of the density p* , in any greater de- 
tail. We just require that the velocity dispersion parameter 
a be closely related to the actual physical (central) veloc- 
ity dispersion (f 2 )», as naturally happens on dimensional 
grounds. 

The equation which defines the half-light radius _Rh is 

/•-Rh 

M*(R h ) = 4tt 



rdr 



RdR I p»(c£>(r)/a 2 )^_= 
I r v?" — R 



= 2-k 
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rdr 



R 2 
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A power-law relation Rh_(o~) oc a D is equivalent to the in- 
variance of eqn Q with respect to the transformation 



/3 a, 



p r 



(5) 



for all positive real constants /3. This is an invariant trans- 
formation if and only i£] 



(6) 



1 We are neglecting here the shallower dependences upon any 
other free parameter, as for example the tidal radius. 
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For reasonable, non-constant luminous densities p*, the con- 
dition (|6]) is satisfied if and only if the gravitational potential 
itself has a power-law dependence on the radius, that is if 



d dlog<E> 
dr dloer 
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It follows that the gravitational potential itself is a power- 
law: 



$(r) = $o — 



(8) 



where 5 — 2/D. Under the present universality hypothesis, 
the exponent D is directly determined by the shape of the 
dark matter potential only, that is, by the constant S. In 
other words, the correlation can be re-written as 



^=8(5 J,) 

ro ^ ' V$o 



(9) 



The symbol 8 indicates a simple coefficient, independent of 
the velocity dispersion parameter, but directly related to 
the value of the exponent 8 and to the properties of the 
distribution function /*. Different distribution functions /* 
can only affect the precise value of the coefficient 8, but 
cannot modify the exponent D. We have thus proved the 
general theorem that for a spherically symmetric, universal 
dark halo, a power-law correlation between half-light radius 
and central velocity dispersion can exist only if the potential 
is a power-law of the radius. 

Of course, the gravitational potential of any dark 
halo is surely more complicated than a simple power-law. 
Nonetheless, many popular dark halo models, such as the 
Navarro-Frenk- White or cored isothermal profiles, are well- 
approximated by power-laws in the inner parts. For dSphs in 
which the luminous material is embedded deep within a dark 
halo, a power-law approximation may work well. Nonethe- 
less, we expect that the power-law correlation {1} will break 
down for objects with larger half-light radii. 



2.2 The Data 



We adopt the dataset reported by IWalker et al.l (|2010h . In 
Table [1] we reproduce the entire dataset, correcting a typo- 
graphical error in the half-light radius of Tucana. 

The major awkwardness in fitting the data is incor- 
porating the observational err ors, which are of com para- 
ble significance on both axes. IWalker et all l|2009cT ) used 
an iterative fitting method that assigns weights accord- 
ing to measurement u ncertainties in both dimensions from 
iRutledee etafl (|l997h . They found log C « -1.5 and D w 5 
(see Walker et al. 2010). One consequence of the statisti- 
cal technique is that the fit is strongly constrained to he 
close to datapoints with small error bars, which in this case 
means, for example, the Sagittarius (Sgr) dSph, amongst 
others. This may be undesirable here, as the Sgr is un- 
dergoing disruption in the Milky Way halo and its halo 
properties may be somewhat different to the bulk of the 
sample. It i s not even clear that the progenitor of the Sgr 
was a dSph |Niederste-Ostholt et al.ll2010l : Penarrubia et all 
|2010D . 

Here, we prefer to use the alternative statistical 
technique of Structural Analysis (jKendall fc Stuartl Il979l : 
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4.3 ± 1.2 


3.3 ±2.1 x 
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140 ± 25 
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590 ± 20 
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Table 1. Half-light radii and central velocity dispersions for 28 
dSphs. References: (1) Irwin & Hatzidimitriou 1995; (2) Walker 
et al. 2009b; (3) Martin et al. 2008; (4) Walker et al. 2007; (5) 
Mateo et al. 2008; (6) Koch et al. 2007; (8) Martin et al. 2007; (9) 
Koch et al. 2009; (10) Simon & Geha 2007; (11) Aden et al. 2009; 
(12) Belorukov et al. 2008; (13) Walker et al. 2009a; (14) Irwin et 
al. 2007; (15) Geha et al. 2009; (16) Belorukov et al. 2009; (17) 
McConnachie & Irwin 2006; (18) Cote et al. 1999; (19) Chapman 
et al. 2005; (20) Ibata et al. 2007; (21) Letarte et al 2009; (22) 
Lewis et al. 2007; (23) Ibata & Irwin 1997; (24) Majewski et al. 
2003; (25) Saviane et al. 1996; (26) Fraternali et al. 2009. 
( a ) Structural parameters refer to the bound central region of Sgr 
(see Majewski et al. 2003). 



Hodg kin et al.ll2009l ). We will perform this analysis in both 
the natural and logarithmic planes. 

2.3 Algorithm 

The method is based on the maximization of the likelihood: 



Yi 



+ 



z% — F(Yi; D, G) 



(10) 



in which {yi, Zi) are the N observed values of the data pairs 
with standard deviations (<J yi , a Zi \ while Zi — F(Yi;D,C) 
is the underlying power-law model, namely eqns (0 or ([2]). 
The essence of the method is that the TV coordinates Yi are 
considered unknown. This is in contrast to the classical x 2 
evaluation, in which the independent variable has no obser- 
vational error and so Yi = yi. For each pair (D,C) of the 
free parameters, the N coordinates Yi are chosen to be those 
that maximize the likelihood. 




Figure 1. Schematic depiction of the division of the logarith- 
mic plane into four quadrants, defined by the observational point 
(Vi, Zi). 



Let us first consider the case in which the fit is per- 
formed in the natural plane, and thus the model function 
F is given by eqn |l|). In this case, the errors (cr yi ,(j Zi ) are 
straightforwardly identified with the observational errors in 
Table [T] and thus the likelihood L is a continuous and dif- 
ferentiate function of the coordinates Yi. As a consequence, 
the following set of iV equations determines the values of Yi 



dlnL 

dY 



V< 



Y, 



dF 
dYi 



Zl -F(Yi-D,C) 







(11) 



For general values of D, eqns (|ll|l cannot be solved analyt- 
ically, but, given the continuity and differentiability of the 
likelihood, they can be easily solved numerically. 

If the fitting is performed in the logarithmic plane, 
then matters are slightly more complex. Here, the errors 
(p Vi , a Zi ) , are asymmetric with respect to the central obser- 
vation point (yi,Zi). The (Yi,Zi) plane is then effectively 
divided in four quadrants, each characterized by a different 
(vyi, o"zi) pair, and for this reason eqns (|lip must be recon- 
sidered. 

Let us consider an observation point (yi, Zi), which nat- 
urally defines four quadrants Qj in the (Yi, Zi) plane, char- 
acterized by well defined pairs of errors (o yi , a 3 z . ) , as shown 
in Fig. [T] Each of these quadrants could in principle be in- 
tersected by the fitting function Zi = F(Yi \ D,C). In our 
case, the fitting function is a straight line, and then for a 
fixed (D, C) pair, only three quadrants will be intersected. 
For each of these three quadrants, we replace eqns pip with 



- inf 

(y,,Zi)e<2 3 



Vi-Yi 



+ 



Zl -F(Y;D,C) 



(12) 

i|12[) allow us to define 
to the likelihood, one 



For the i-th observational point, eqns 
three candidate contributions (lnL)^ 
for each of the three intersected quadrants Qj . These points 
are those that maximize the likelihood in each single inter- 
sected quadrant. Among these possible contributions, only 
the one with the smallest absolute value is chosen, so that: 



mm 

3 



(13) 



This generalization allows us to solve the problem of fitting 
in the logarithmic plane, whilst maintaining intact the basic 
idea of the Structural Analysis method: eqn (|12[) naturally 
reduces to eqn (|11[) when errors become symmetric. 
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Subset 


D 


logC 


X 2 


Entire sample 
Classical dSphs 
Faint dSphs 


o 7+0.78 
°-'-0.55 

3.5+ 2 * 

q q+0.7 
°-°-0.6 


-0.76±°' 7 5 
-0.78±i' 7 
-0.22lg; 7 


76 
10.5 
47 


Entire Sample 
Classical dSphs 
Faint dSphs 


o q+0.9 
d - y -0.6 

q K + 0.85 
°-°-0.6 


-i n4 +0 ' 6 

-0.87li 4 
-0.54lg;I 5 


84 
10 
62 



As a flexible family of dark matter haloes, we use 



Table 2. The values of D and logC together with the x > using 
fitting in the natural plane (upper table) and logarithmic plane 
(lower table) 



2.4 Results 

The fitting analyses have been performed in both the nat- 
ural and logarithmic plane separately for the entire sample 
presented in Table Q] (28 objects). It is also interesting to 
split the sample into the classical dSphs - namely the first 
eight objects in Table [1] - and the next twenty objects - 
which are predominantly the ultrafaint dwarfs. 

For each of the three samples, Table [2] lists the best-fit 
values of the exponent D and the logarithm of the coefficient 
G, together with errors corresponding to the 68% confidence 
regions, and finally the associated \ 2 value. As might have 
been anticipated, the two chosen parameters are highly cor- 
related, which contributes the largish uncertainties on the 
best fit values. Note that the results obtained in the natural 
and in the logarithmic plane can be different. This is not evi- 
dent in the case of the classical dSphs, but this set comprises 
only 8 objects, which is perhaps too limited to constrain the 
parameters of the model. It seems possible that the classical 
dSphs and the ultrafaints are offset in C and D. This means 
that the confidence regions associated with the two subsets 
hardly overlap in the (D, log G) plane. As a consequence, 
the best-fit exponent of the entire sample is not an average 
value between the best fit exponents of the two subsets, but 
it is steeper than both of them. 

It is a reasonable conclusion from Table [2] that claim- 
ing a precise fit to the data in Table 1 makes very little 
sense. First, there is a range of pairs (D, C) that are reason- 
ably compatible with the data. Second, the best fit parame- 
ters still give quite poor fits, especially for the faint dwarfs. 
This is after all what was expected at outset as only a pure 
power-law dark matter potential can give a power-law cor- 
relation. In particular, the exponent of the correlation D 
cannot be constrained any better than saying that it satis- 
fies 3.2 < D < 4.4. With the present data, establishing such 
an interval is a more sensible task than providing a single 
best-fit solution, whatever fitting method has been used. 



3 ISOTHERMAL DISTRIBUTION 
FUNCTIONS 

We can gain some useful insights into the correlation by 
again supposing that all the luminous components of dSphs 
are embedded in exactly the same dark matter halo. The 
correlation then necessarily reflects the properties of this 
unique potential well only. 



p(r) 



Pa 



- 1+ — 



(14) 



Throughout the paper, we will concentrate in particular 
on the following three choices: 




(1,1,2) 
(0,2,3) 
(0,2,4) 



(15) 



The first choice of parameters yields the cosmologically- 
motivated Navarro-Frenk- White (NFW) model. The second 
two choices are standard examples of cored models, one with 
the same asymptotic density fall-off as the NFW profile 
(p ~ r~ 3 ), whilst the second with a faster fall-off (p ~ r~ 4 ). 

With an eye to later developments, it is also helpful to 
introduce a dimensionless half-light radius R\ by scaling the 
true distance by the characteristic radius ro of the dark halo 



R b = Rh/r . 



(16) 



Next, we use the characteristic density po to define the di- 
mensionless velocity dispersions: 



- 2 

a 



a 

5? 



4(r/ro) 



(17) 



where $o is the first non-constant term in the Taylor expan- 
sion of the potential. 

Taking our inspiration from the observ ed flatness of 
dSph velocity dispersio n profiles (see e.g., iKlevna et al.l 
l200ll: IWalker et al.[[2007h . we assume that the stars have an 
isoth ermal Maxwellian distribution function (|Evans et al.l 
120091) : 



E 

f*(E) oc exp I j 



(18) 



where E is the energy and a is a constant. For this choice, 
clearly, any component of the stellar velocity dispersion is 
just the constant a, which is also identical to the projected 
velocity dispersion er p . 

Let us start with an NFW halo, which in its central 
parts, has the gravitational potential 



r(r) = -47TGr 2 p 1 



2r J 



0(r s 



(19) 



thus $0 — 27rGpo r o- 11 th e stellar component is embedded 
deep within the halo, the potential is approximately linear. 
Then 8 = 1 in eqns ([8]) and ((9]), and so the -Rh(<5" p ) corre- 
lation is quadratic (D = 2). Specifically, for an isothermal 
distribution function, we get: 



Rh 



2.027 6p 



(20) 



As an alternative, let us take any cored model. Then, 
the gravitational potential is harmonic in the inner parts of 
the roughly constant density core: 



$corc(r) = -47rGr o p 



1 



+ 0(r 3 



(21) 



so that $0 = 47rGpo?"o/3- As a consequence, the associated 
Rh(d'p) correlation is linear (D = 1), namely 



Rh 



(22) 
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In both cases, the exponent D of the correlation is signif- 
icantly shallower than that inferred from the observational 
data (3.2 < D < 4.4). 

Of course, whilst a power-law correlation is a fair start- 
ing point to perform a fit to the (modest) available data, 
it does not make a great deal of sense when translated in 
the context of models. The direct consequence of this as- 
sumption is the power-law parametrization of the potential 
itself. For systems whose luminous scalelength becomes com- 
parable to the scalelength of the dark halo itself, significant 
deviations from a power-law correlation are expected. We 
need more complex models to give a proper description of 
the data, and it is to this subject that we now turn. 



4 QUASI-ISOTHERMAL DISTRIBUTION 
FUNCTIONS 

4.1 The Half-Light Radius and Central Velocity 
Dispersion 

Henceforth, we use the full potentials associated with the 
dark matter halos (114[1 . and in particular with the three 
profiles given by (|15p . As these tend to zero at large radii 
r 3> ro, we use a lowered isothermal distribution function. 
An isothermal component has a divergent total mass if im- 
mersed in a potential regular at spatial infinity. We cannot 
allow such a behaviour, since then the half-light radius Rh 
is not a well defined quantity. We thus choose the quasi- 
isoth ermal or King distribution function (|Michidll963l ; (Kind 
1966) for the stars: 



P*,o 



[exp(-E K /a)-l] 



(23) 



(27RT 2 ) 3 / 2 

where — E — $(r t ) and r t is the tidal radius0 

The density and velocity dispersion profile generated 
by this distribution function (|23|) are given in terms of the 
underlying gravitational potential as: 

p*(r) = 4-7T / /* v 2 dv 
Jo 



P*,o 



exp (b )erf (6) 



b+-b 3 

/7T V 3 



(24) 



2 Unfortunately, the nomenclature in this area is confusing. Mod- 
els generated by the distribution function H23II arc often called 
"King models", as they were popularised by King (1966). For ex- 
ample, this is the meaning of the term in the standard textbook 
of Binney & Tremaine (2008, pp. 207-311). However, the phrase 
"King model" is also used to describe the empirical fitting formula 
for the density introduced in eq. (14) of King (1962). This causes 
no confusion in applications to self-gravitating star clusters, as 
King's distribution function generates a density profile similar to 
King's empirical law. However, it is confusing for systems such as 
dSphs in which an external gravity field from the dark halo dom- 
inates, a nd the two "King mode ls" are now very different. For 
example, Penar rubia et all (2008) study "King models" embed- 
ded in NFW haloes, and by this they mean the empirical density 
law is placed in an NFW halo. However, there is now no guarantee 
that the stellar velocity dispersion profile of the "King model" is 
flat. In fact, it is clear from their Figure 2 that the line of sight 
velocity dispersion is not flat, but rising or falling in the radial 
range where the stars are mostly probing (—1.5 < log(R/r s ) < 0.5 
using their notation from their figure) . 



(A 



4tt 
P* 

n 1 
3<T 



V2ab 



f-t v 4 dv 



p»(r)/p»,o_ 
in which, for brevity, we have written 

b = [b(r,a,r t )] = . 

a* 

The equation that defines the half-light radius Rh is 



(25) 



(26) 



4tt 



R dR 



P* M r , °: r t)] 



rdr 



Vr 2 - R 2 



= 2ty RdRT, t (R) = ir RdR £„(i?) , 
Jo Jo 

while the projected velocity dispersion is given by 
2 f +ca , , , rdr 



3£*0R) J R 



p*{r){v )» 



R 2 



(27) 



(28) 



Our models are determined by four dimensional scales: 
given a (n, er) pair for the luminous component and a (ro, po) 
pair for the dark halo potential $, the properties of the mod- 
els are completely fixed. As the luminous components are 
tracers, the value of the parameter p*,o, which fixes the cen- 
tral stellar density, does not need to be specified. It does not 
affect either the half-light radius Rh or the projected central 
velocity dispersion <j p ,o. If we instead use dimensionless pa- 
rameters [see eqns (|16[) and (|17[) ]. then we can complete the 
set by introducing, together with <r, the dimensionless tidal 
radius ft = n/ro, which is the true tidal radius divided by 
the characteristic scale of the dark halo. 



4.2 Theoretical Correlation Curves 

Our aim is to describe the behaviour of the half-light radius 
Rh in the dimensionless parameter space (f t ,<r) which char- 
acterizes our models. Some of the asymptotic properties are 
analytic, as discussed in Appendix A. 

In order to obtain a unique Rh(&p,o) correlation, we 
need to fix the second free parameter ft . For reasons that will 
shortly become apparent, this we do by setting the structural 
parameter 



n/Rh = f t /Rh 



(29) 



to be a constant. That is, we construct our R\(a Pt o) correla- 
tion by following one of the contours of k in the (ft, a) plane 
(see for example Fig. Al) and characterise the correlations so 
obtained by i?£(<r Pi o). By consideri ng the photometric pro- 
files o f the eight classical dwarfs in llrwin fc Hatzidimitrioul 
i|l995h . we can grasp a lower limit for k. The tidal radius r t 
must be greater than the final radius Hast which has a signif- 
icant non-zero photometric measure. From this, we conclude 
that all the classical dSphs have k>4. A somewhat crude, 
but nevertheless useful, upper value for k can be given by 
arguing that it is highly improbable the tidal radius of a 
dSph is larger than a quarter or a fifth of its distance from 
the centre of the Galaxy, giving us the constraint k -C 100. 

The eight classical dSphs also show projected velocity 
dispersion profi les which are flat ou t to the last measured 
points (see e.g.. Walker et al]|2009ch . For Draco, this means 
out to approximately nine half-light radii; for Carina, Leo I 
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Figure 2. The dimcnsionless half-light radius Ri r is plotted against the dimensionless projected central velocity dispersion 6" p ,o- Upper 
left: The theoretical relation i?^ =10 with the two endpoints corresponding to 7 = 4 and 7 = 8 for an NFW profile. The red line shows 
the analytic and asymptotic result (1 20 P . Upper right: The curves ijjj =1 ° and _R£ =100 corresponding to 7 = 6 for an NFW profile. Lower 
left: The curves Rh(°p,o) for the cored dark matter density distribution Q14I with (a = 0, b = 2, c = 3). The red line shows the analytic and 
asymptotic result 1221) . Lower right: The curves -Rh(<?p,o) f° r the cored dark matter density distribution ( 1 141) with (a = 0, 6 = 2, c = 4). 
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Figure 3. The 68% and 95% confidence regions of the likelihood (1 3 1 I t respectively for the dark matter halo profiles. Full dots indicate 
the best fit models. The red shaded areas in the (ro,po) plane are forbidden by the constraints l|33|l and (J34J . Here, ro is measured in 
pc, and po m Mqpc -3 . 



and Sculptor, out to approximately five. As a consequence, 
we restrict attention to models which satisfy the condition 



T P {lRh) > 0.9 cr p (0). 



(30) 



This ensures that the profile is flattish out to 7 multiples of 
the half-light radius. 

For a fixed value of n, the effect of imposing the con- 
dition (|30|) is to define the model with the highest possible 
<7 P and R^- This model represents the endpoint of the cor- 
responding -Rh(5" Pj o) relation. Models beyond the endpoint 
are unacceptable as their velocity dispersion profiles do not 
resemble those of the dSphs. In other words, the higher the 



value of 7, the shorter the theoretical relation R^(a p fi). For 
a fixed value of k, the curves obtained for large values of 7 
are exactly contained within those of smaller values of 7. For 
example, the upper left panel of Fig. [2] shows the relation 
obtained embedding our models with fixed k = 10 within 
a NFW halo; the two different displayed endpoints corre- 
spond to 7 = 8 and 7 = 4. We note that the curves follow 
the asymptotic quadratic relation (|20[) . which is valid in the 
regime in which <t Pi o <C 1. 

The upper right panel of Fig.[5]shows the effects of vary- 
ing the value of the parameter k. The theoretical relations 
corresponding to n — 10 and k = 100 are plotted together. 
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Entire sample 
Classical dSphs 
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u,o -0.09 
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ii-2±lJ 


143 
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Table 3. Results of the fitting analyses under the universal halo 
hypothesis. The analysis has been performed separately for the 
eight classical dSphs and for the entire sample (28 dSphs). The 
coding referred to the dark matter profile reads as in eqn (TS) : 
(1) NFW profile; (2) Cored profile with p ~ r -3 as r —y oo; (3) 
Cored profile with p ~ r — 4 as r — > oo. 



The displayed endpoints are fixed in both cases by 7 = 6. 
The K = 100 relation extends to higher values of a and A£. 
In this case, the inclusion between the two relations is not 
exact. However, in their common domain, the two relations 
are still roughly identical, given that the observational errors 
on the available <r p ,o and Rh data-points are quite large. 

The properties of the Ah(o~ P ,o) relations produced by 
our models allow us to impose a useful simplification. We 
need consider only the Ah(5" P ,o) relation produced by the 
highest plausible value of k and lowest plausible value of 7. 
Provided we overestimate k and underestimate 7, we can 
be sure that, besides small deviations, all possible J?h(o" P ,o) 
curves obtained for other realistic («, 7) pairs are in fact con- 
tained in ours. As a consequence, we define our final Ah(o" P ,o) 
relation as that obtained by the pair (k, 7) = (100, 6). With 
this set, the reader may worry that we are overestimating 
7 and hence overlooking some viable models. However, we 
have shown that the effect on the i?£ (&p,o) relation of a lower 
7 is similar to that of a higher k. Since we are surely overesti- 
mating ft, we are not losing models at the end of the Ah(a p ,o) 
relation. The upper right panel of Fig. [2] therefore shows our 
final i?h(<5"p,o) relation for the NFW dark halo. Its endpoint 
for 7 = 6 corresponds to (<5- p ,max, Ah, max) ~ (0.35, 1.2). 

The lower panels of Fig. [2] show the final Ah(o" P ,o) 
relations obtained for the two cored profiles of eqn ()14[) 
with (a = 0, b = 2, c = 3) and (a = 0, b = 2, c = 4) re- 
spectively. To recapitulate, the former has a density fall- 
off like r -3 at large radii, the latter like r -4 . Their end- 
points are respectively at (<T p , max , Ah, ma x) ~ (0.66, 1.5) and 
(<5" p ,max, Ah, max) ~ (0.54, 1.65). In both panels the red pro- 
files show the asymptotic relation for cored models (|22[) . 
It is clear that the extrapolation of such an approximation 
out of the regime <r p ,o <C 1 is unreliable. For larger veloc- 
ity dispersions, the true Ah(<5" p ,o) has, in all cases, a much 
steeper dependence on <r p ,o than the power-law correlation 
obtained in the asymptotic limit. Finally, even if the qualita- 
tive shape of the three different Ah(<5" Pi o) relations obtained 
for the three dark matter profiles of eqn (|15p remains sim- 
ilar, when looked in detail, they show several quantitative 
differences. 
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Figure 4. The po(ro) functional dependence generated by differ- 
ent halo profiles using Fornax as an illustrative example. Here, ro 
is measured in pc, and po m Mqpc - 3 . For each (ro, Po( r o))i the 
Fornax half-light radius and central velocity dispersion is mapped 
onto the underlying theoretical -Rh(°"p,o) curve. The endpoints of 
the relations are marked with filled circles. (Full line, dashed line 
and dash-dotted line represent respectively the dark matter den- 
sity profiles as ordered in eqn. 1151 1.) For comparison, the red 
shaded area indicates the expectations of a ACDM cosmological 
model (see text for further description). 



5 THE UNIVERSAL HALO HYPOTHESIS 

For the moment, we continue to assume that the luminous 
components of all the dSphs are embedded in exactly the 
same universal dark halo, which in turn has one of the three 
density profiles of eqns (| 14p and (|15[1 . By fitting the data- 
points in Table[T]to the respective Ah(<5" p ,o) relation, we can 
both measure the best characteristic central density po and 
radius ro associated with each profile and compare the qual- 
ity of the three fits to gain insight into the closest match to 
actual dark halos, at least under the hypothesis of universal- 
ity. We use the technique of Structural Analysis introduced 
in Section \2. 31 The likelihood we maximize is 

(31) 

in which (j/;, z;) are the N observed values of the data pairs 
with standard deviations (cr Vi , cr Zi ), and Ah(Vi) is the theo- 
retical Zi = Ah(<5" P ,o) relation generated by the halo model, 
whilst $o has been defined in Section 3. The iV equations 
which determine the values of Yi are 



<91nL r^—Vi 
= V$o — 



dYi 



YiV$o 



+ ro- 



dR h 
dYi 



Zj — rpR^Yj) 



= 



(32) 



which can be easily solved numerically. 

Note that each of the theoretical Ah(o- p ,o) relations 
comes with its own bounded domain of validity (<7 p ,o G 

[0,CT p ,max], Ah G [0, Ah.max] = [0, A h (<7p,max)] ) • This implies 

the existence of constraints for the possible physical param- 
eters, po and ro. First of all, considering the Sgr dSph which 
has the largest half-light radius in Table [T] then we must 
have 



roRh,i 



1550pc. 



(33) 



This must be so, otherwise the objects with the largest 
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half-light radii may not be compatible with the theoretical 
Ah(5" P ,o) correlation in the universal approach. In the same 
way, by taking Tucana dSph which has the largest central 
projected velocity dispersion of cr p ,o = 15.8 kms -1 , we ob- 
tain 



> 15.8kms~ 



(34) 



Note that 
straints (13311 



the detailed values provided by the con- 
_ and (|34p depend on the dark matter profile un- 
der consideration, which fixes the endpoint (o" p , m ax, Ah, max) 
of the curve -Rh(<5" Pi o). This suggests that the best set of 
parameters in which to perform the numerical maximiza- 
tion of the likelihood ((5T|) is [l/r , l/-\/5o]- We can easily 
transform our final results into the natural plane (ro, po), as 
shown in Fig. [3] for the three dark matter profiles. The 68% 
and 95% confidence regions according to the likelihood are 
shown. Note that the constraints (|33[) and (|34[) have been 
used in displaying the (ro,po) plane: in each panel, the red 
shaded area is forbidden. Table [3] lists the best-fit models, 
which are indicated with full dots in Fig. [3l together with 
the associated \ 2 value. Also listed are the results of similar 
fitting analysis performed on the classical dSphs only - that 
is, the first eight dSphs in Table [T] In this case, the dSphs 
with the largest Rh and <t Pi o are respectively Sextans and 
Fornax. 

The results of this analysis are not particularly encour- 
aging for the universal halo hypothesis. Even if the NFW 
profile seems to provide the best fit to the available data, 
the resulting physical parameters (ro f ,Po f ) do not make 
great sense, at least as judged from the expectations of nu- 
merical simulations of dwarf galaxy formation. The best fit 
characteristic radius is in fact extremely large, namely 4.9 
kpc, while the characteristic density (to balance the extreme 
value of Tq C ) is very low, corresponding to a concentration 
parameter as small as c ~ 6.5. If the analysis is restricted to 
the classical dSphs, the fits are better, suggesting that the 
universal approach is less inadequate. 



6 SINGLE OBJECT ANALYSES 
6.1 The Method 

As an illustrative example, let us consider the Fornax dat- 
apoint, (<t Pi o, i?h)por, given in Table [1] The problem is to 
choose the scaling (ro, po) that maps this dimensional data- 
point to models on the underlying dimensionless theoretical 
curve -Rh(o" P ,o) for a given dark halo profile. 

As we have seen, any halo profile is naturally associ- 
ated with a specific domain [0, <T Pjmax ]. For each <t Pi o in this 
interval, one and only one (ro, po) pair exists such that 



(o-p,o, -Rh)For = (& P ,oV$o > r R h (a P:0 )). 



(35) 



We can thus define a one-to-one relation between the two 
characteristic scales po and ro, that we simply indicate by 
Po()"o), or equivalently by ro(po). Each of the (ro, po()"o)) 
points corresponds to a different position of the Fornax dat- 
apoint on the theoretical -Rh(o" P ,o) relation. 

Fig- SI for example, displays the three po(»"o) functional 
dependences generated by our three canonical dark matter 
profiles, together with the respective endpoints, for the case 
of Fornax. Clearly, the dependence changes for each of the 
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Table 4. Summary of the properties of the probability distribu- 
tions obtained for the characteristic radii ro(c = c\) and ro(c = 
e u ), together with the maximum concentration parameter c ma x- 
We recall that c u = min(30, c max ) and that c\ = min(15, 2c u /3). 



three different dark matter profiles. Furthermore, the exis- 
tence of an endpoint model (a Pimax , -Rh(<5" P ,max)) implies con- 
straints for ro and po, which respectively have a minimum 
and maximum available value for each dSphs. For future use, 
we denote the minimum characteristic radius by ro, m in and 
the associated maximum value of the characteristic density 
by po,max = po(?"o.mm). Notice also that, for the specific case 
of the NFW profile, the existence of such a functional depen- 
dence naturally corresponds to the existence of an analogous 
link between the characteristic radius ro and the concentra- 
tion parameter c, which, for clarity, we recall to be defined 
by 



Po(c) 



600# ( 2 



(36) 



8ttG ln(l + c) - c/(1 + c) ' 

in which Ho is the Hubble constant (|Navarro et al ][l9<i). As 
a consequence, for each dSph, the maximum characteristic 
density po.max also corresponds to a maximum compatible 
value of the concentration, c m ax- 

It is useful to campare to the predictions of numerical 
simulations of halo formation in ACDM. So, Fig. [4] also dis- 
plays the translation in the (po,f"o) p lane of the c v i r (M v i r ) 
functi onal relationships ca lculated in iBullock et al. I (|200l| ) 
and in lKuhlen et al.l l|2005h (see especially their Fig. 9). For 
the specific case of Fornax, this allows us to compare the 
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Figure 5. Mass profiles M(r) for Fornax assuming an NFW halo (upper left) and cored haloes (upper right and lower left). The curves 
correspond to different values of the halo scalelength ro, in particular 1 (in red), 2, 4, 8 and 15 (in black) times the minimum characteristic 
radius ro, m i n associated with each dark matter profile. The lower right panel is a superposition of all the mass profiles in the preceding 
panels and shows the existence of a special radius at which - despite our ignorance of best choice of model or scalelength - the uncertainty 
in the enclosed mass is minimised. 



expectations of a ACDM cosmological model with our re- 
sults. It is interesting to note that the models selected by 
the ACDM scenario for the Fornax dSph are those at the 
highest end of the po(ro) relation, and that thus have a lu- 
minous and a dark component with similar scale lengths 
(-R h w 1). 

Now let us consider the halo enclosed mass: 



M(p ,r ,r) 



47rx 2 p(po,ro, x) dx. 



(37) 



The existence of the functional relation po(ro) allows us 
to reduce the dimensionality of the free parameters of 
the halo mass functions by one, so that M(po,ro,r) = 
M(po(ro), ro, r) = M(ro,r). Thus, to produce a mass mea- 
sure for each of the dSphs, we need only a dark matter profile 
and a characteristic radius. 

However, even given our ignorance of the correct choice 
of dark matter halo density profile and characteristic radius 
ro, we can still provide a reliable mass measure for each 
of the dwarfs by using only the single (a Pt o,Rh) datapoint. 
This unexpected result is illustrated explicitly in the case of 
Fornax by Fig. [5] This shows the enclosed halo mass (|37[) for 
different plausible choices of ro and for our three different 
dark halo density laws (|15[) . 

It is clear that for each density profile there is a special 
radius r spcc , inside which the uncertainty of the mass mea- 
sure caused by our ignorance of the characteristic radius ro 
is minimized. If we know the dark matter density law - for 
example from cosmological considerations - then the error 



that we make in measuring the enclosed mass in the absence 
of accurate information on the characteristic radius is barely 
is 10%. Surprisingly, the locations of the special radii for the 
different halo profiles are close to each other, and amount 



to on average a value of r s 



1.7i?h- This result derives 



directly from the shapes of the -Rh(S" Pi o) functional relations 
defined by our lowered isothermal models with flatfish pro- 
jected velocity dispersion. 

Although we have shown the results of our calculation 
for Fornax only, we have carried out similar computations 
for all the dSphs, which leads to the conclusion that a uni- 
formly good choice for the radius r spcc is r « 1.7Rh- (For 
the sceptical reader, this result can also be inferred from our 
later Fig. 0) 

This result has some superficially similar analogues in 
the recent literature. A number of authors have looked at 
classes of anisotropic models from a Jeans equation per- 
spective and argued that the mass within the half-light 
radius is robust agains t changes in the ha l o model and 
the anisotropy (see e.g . , IStrigari et al.ll2007al ; I Walker et all 
l2009d ; IWolf' et al.|[2010h . The Jeans equations of course are a 
weak constraint, and there is an enormous freedom in solving 
for the enclosed mass in terms of multi-parameter models of 
the light and anisotropy profiles. Very often, the Jeans so- 
lutions are meaningless in that, although the stresses are 
positive, there is no physical distribution. A classic example 
is provided by cored light profiles in cusped dark halo pro- 
files. The Jeans solution exists and is physical - but there 
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Figure 6. For each dSph, the upper panel displays the probability distributions for the halo mass inside 1.7i?h, while the lower panel the 
distribution for the characteristic halo radius ro- Black and red represent the lower and upper concentration parameter NFW models. 
Masses are in units of !O 7 Af0, while radii are measured in kpc. 
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Figure 7. Comparison with previous mass measures, all radial coordinates are scaled to the half-light radius, while masses are measured 
in Mq. The shaded regions represent respectively: NFW halos with characteristic radii from ro, m i n to 5ro, m i n in gray; cored halo with 
fall-off p ~ r -3 with characteristic radii from ro jm m to 5ro jIn i n in red; cored halo with fall-off p ~ r~ 4 with characteristic radii from 
''O.min to 5ro, m i n in green. The empty triangle indicates our mass measure inside 1.7-Rh with error from the observational uncertainties. 
The blue filled circle is the measure of the mass enclosed in from Walker et al. (2010). Black dots are measures from Strigari et al. 
(2007) and (2008). The additional empty circle for Segue f is a measure from Geha et al. (2009). 
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is no distribution function l|An fc Evansl 120091 ). Hence, this 
approach leads to weak constraints as a wide class of mod- 
els, many of which are unphysical - being considered. Once 
this has been appreciated, it is less surprising that quasi- 
isothermal phase-space models allow us to fix the mass to 
much greater radii. 

Hence, the Jeans based analyses are studying how the 
mass estimates are affected by anisotropy. They show that 
the mass within about a scale radius is largely unaffected 
by assumptions as to anisotropy. Our starting point here is 
different. We are arguing that the central parts of the dSphs 
are surely isotropic and are happy to build that hypothesis 
into our models. Our result is that by varying the halo pro- 
files, the mass within 1.7 half-light radii is largely unaffected 
by changes in the mass profile. 

6.2 The Masses of the dSphs 

In fact, the effect of the observational errors on the mass 
measure inside r apcc is larger than any differences caused 
by changing the halo models. To quantify this, we use a 
Monte Carlo numerical method. For each dSph, we study the 
distribution of the mass measures generated by assuming a 
normal distribution for the observational (<r Pi o,Rji) points 
with means and standard deviations as in Table Q] Each 
random point extracted from these probability distributions 
generates a po(fo) relation, as in eqn (|35[) , and hence a mass 
measure. We perform this analysis only for the NFW halo 
on the grounds of its cosmological importance. The value of 
ro used to produce each mass measure is calculated by fixing 
the concentration c to a lower c\ and an upper c u value. 

Numerical simulations suggest that dSphs have dark 
hal oes of NFW form w ith concentration values 20 < c < 
30 (|Navarro et al.lll996T ). which we use as lower and upper 
bounds in the following analysis. However, we have already 
shown that there is a maximum value of the concentration 
Cmax for each dSph consistent with it falling on the theoret- 
ical curves -Rh(o" P ,o). For some of the dSphs, c max is already 
less than 30 - namely, in the cases of Sextans (c max w 23) , C 
Ven I (c max w 26), Hercules (c max 22), And II (c max w 16), 
And IX (c max w 25), and Sagittarius (c max w 16). For these 
dSphs, we use 2c max /3 and c max as our lower and upper 
bounds. 

The results are displayed in Fig. [6] which show the prob- 
ability distributions for the enclosed masses M(1.7Rh) and 
the characteristic scalelengths ro of the NFW halo. Note 
that, even though the two different assumed concentrations 
imply rather different distributions for the scalelengths, this 
has almost no effect on the mass measures because of our 
choice of r spcc = 1.7-Rh- Only the objects with the smallest 
relative errors in the observational data, such as Fornax, C 
Ven I and Sgr, have recognizable differences in the modes 
of the two mass distributions, although these remain small 
compared to the intrinsic spread of the distributions. 

The shape of the mass distributions in Fig. [6] depends 
critically on the quality of the observational data for the 
central velocity dispersion ct p .o or half-light radius R^. If 
the uncertainties are small, then the probability distribu- 
tion is a well-behaved, nearly normal function. We extract 
the mode or most likely value for the mass, together with 
the values at half maximum, which gives the the range (re- 
call that for a Gaussian, the full-width at half maximum is 



~ 2.35<r). However, if the observational datapoints are poor, 
the probability distribution acquires a highly skewed struc- 
ture. The objects that suffer from this difficulty are Bootes 
2, Leo IV, Leo V, Segue 2 and And XV. Here, the peak of the 
distribution occurs at mass scales too small to be clearly re- 
solved, and we can give only the mass at half the maximum 
as an upper limit. A similar, but less dramatic problem, is 
encountered in Willman I, and And IX. The mass estimates 
and range are listed in the first column in Table [S] 

The shape of the distributions for the characteristic 
scalelength ro of the NFW haloes in Fig. [S] are well-behaved, 
although they do depend on the assumed concentration pa- 
rameter. For the upper and lower values for the concentra- 
tion, the mode or most likely value for ro, together with 
the values at half maximum are listed in Table [4] Using the 
(fi-p.o, Rh) datapoint only, we are unable to determine ro with 
any confidence. To achieve this, we would need further obser- 
vational information, particularly the detailed photometric 
and kinematic profiles, which are only available for a handful 
of the brightest dSphs. Nevertheless, we can still deduce an 
interesting conclusion. The listed maximum concentration 
parameters c max range from values as small as 16 for Sgr and 
And II to very high values (over 150) for the faintest objects 
like Segue 1, Bootes 2 and Willman 1. The nearer c max is to 
the values of the concentration suggested by the cosmolog- 
ical simulations, the nearer the values inferred for the two 
characteristic radii ro(c = c\) and ro(c = c u ) are to the value 
of the observational half-light radius .Rh. However, a number 
of predominantly the fainter dSphs have a much higher c max 
(seven have c max > 70, for example). Such dSphs, then, ei- 
ther they have a central density which is significantly higher 
than suggested by cosmological N-body simulations (see up- 
per panel in Fig. [8j , or they have a ro/ -Rh ratio which is much 
higher than unity. The only way to construct a model with 
a concentration in tune with cosmological predictions for is 
in fact to pick a larger scale radius ro for the halo. If we 
thus believe in the indications given by the simulations, we 
must also accept the existence of a systematic inverse trend 
between luminosity and the ratio ro/Rh (see lower panel in 
Fig. [8j. In other words, the fainter luminosity dSphs must 
be embedded more deeply within their dark matter haloes. 

Also given in Table [5] are the values M~ and M + , 
which respectively indicate the lowest and the highest mass 
measure inside 1.7-Rh among all the models considered in 
Fig. [5] The spread M + — M~ thus approximately represents 
the uncertainty due to the models only. Since the observa- 
tional uncertainties are uniformly larger, it is evident that 
our method of measuring halo masses can provide useful in- 
formation simply by improving the quality of the data in 
Table Q] before embarking on the laborious process of ob- 
taining detailed photometric and kinematic profiles for each 
of the 28 dwarfs. Using the luminosities listed in Walker et 
al. (2009c), mass-to-light ratios are calculated accordingly. 
For most of the dSphs, these measures extend out to much 
larger radii than previously. 

Table [5] also lists a series of previous mass measure- 
ments, namely the measures of the mass enclosed in Rh 
from Walker et al. (2010) and the measures of the mass 
inside lOOpc, 300pc and 600pc from Strigari et al. (2007, 
2008) A comparison between our results and the listed ones 
is displayed in Fig. [7] Note that the spread of the mass 
functions displayed in Fig. [7] only show the deviations of the 
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Table 5. Results for mass measures and mass-to-light ratios obtained in the present paper, together with mass measures from [1] Strigari 
et al. (2007), [2] Strigari et al. (2008), and [3] Walker et al. (2010). 



mass measures due to our uncertainty on the correct dark 
matter density profile and characteristic radius. Again, the 
importance of the choice r spcc = 1.7 Rh is self-evident. 

6.2.1 The Common Mass Scale - Hercules and Leo IV 

I Strigari et al.l l|2008l 'l claimed that all the dSphs shared a 
common mass scale of ~ 1O 7 M within 300 pc. It is worth 
noting at outset that, for the faintest objects like Willman 
1 and Segue 1, 300 pc corresponds to nearly 10 half-light 
radii. The available data is limited to the very central parts 
of the putative dark halos, and the result is based on a stu- 
pendously bold extrapolation. 

Nonetheless, it is interesti ng to see if there are any ob- 
jects for which the assertion of lStrigari et al.l (|200&t ) may be 
disproved. Fig. [7] shows that even within the uncertainties, 
Hercules and Leo IV are not compatible with a common 
mass scale of ~ 1O 7 M . For Hercules, this is because the 
central ve locity dispersion wa s revised downward to 3.7±0.9 
kms" 1 bv lAden et all (|2009al ). who used Stromgren photom- 
etry to discriminate between foreground Milky Way dwarf 
stars and Hercules giants. This com pares with the v alue o f 
5.1 ± 0.9 kms" 1 originally given bv lsimon fc Gehal |2007t ). 
Even allowing for generous model uncertainties, Fig.[7]shows 
the mass of Hercules lies below 3 x 1O 6 M at 300 pc 
(~ 0.9i?h). This agrees with the conclusion of lAden et al.l 



(2009). Another counter-example is provided by Leo IV. 
Here, 300 pc corresponds to ~ 2.6i?h, and Fig.[7]shows that 
the interior mass is again at most 3 x 1O 6 M . 

We con cl ude that the common mass-scale of 
IStrigari et al.l (|200ot ) is illusory. There are indeed dSphs 
for which the mass within 300 pc is ~ lO 7 Af , such as 
Draco, U Mi and Fornax. There are also objects with 
such small half-light radii, like Segue 1 and Willman 1, 
that the uncertainties of the mass extrapolation to 300 
pc are large. Consequently, they may be accommodated 
within a halo of mass ~ 1O 8 M , although much smaller 
masses are probable. However, there are some objects with 
intermediate half-light radii, like Hercules and Leo IV for 
which the common mass-scale clearly fails. 

It is worth noting that the agreement betwen our masses 
and earlier investigators is often good. Where results do dif- 
fer - as for example in Hercules or Leo IV - it is often 
because velocity disperion measurements have been revised. 
Such revisions are always downwards, as contaminants, bi- 
nary stars and variables always introduce additional scatter 
if not properly accounted for. 

6.2.2 Global Correlations 

We are able no w to refine the p l ot bylMateo I (| 19981 ) , also ex- 
tended later bv lGilmore et all |2007l ). In both these works, 
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Figure 9. Correlations between global properties (luminosity, mass and mass-to-light ratios) of the dSphs as inferred in the present 
paper. Red indicates the dSphs associated with Andromeda. 
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Figure 8. Upper panel: The correlation between luminosity 
and concentration parameter necessary to accommodate a NFW 
model with ro/R h RJ 1 for all dSphs. Lower panel: the correlation 
between the depth of the embedding within the dark halo (ro/Rh) 
and luminosity necessary to accommodate a NFW model with a 
similar value of the concentration (c 20) for all dSphs. 



the evident correlation between absolute magnitude and 
mass-to-light ratio for dSphs was interpreted as the the ex- 
istence of a common mass scale, and, in particular, as an ap- 
parent minimum dark halo mass, within th e optical galaxy, 
of the order of 1O 7 M l|Gilmore et al.ll2007l ). We can now ex- 
tend the plot by two orders of magnitude in luminosity, and 
by more than one order of magnitude in mass-to-light ratio 
(inside 1.7i?h)- Even though the faint end is naturally char- 
acterized by larger uncertainties, it is striking that a global 
correlation between luminosity and mass-to-light ratio is still 
strongly evident in our entire sample (see the central panel 
in Fig. [9]). We recall also that our sample contains together 
dSphs that orbit both the Milky Way and Andromeda, but 
we do not notice any strong systematic difference in the plots 
of Fig. [9] between the two populations. We cannot interpret 
the existence of such a correlation as the evidence of a com- 
mon mass scale for the dSphs any longer, as we discarded 
the hypothesis in the previous section (see 16. 2. 1]) . 



Furthermore, it is interesting to look at the correlations 
between mass and mass-to-light ratio, as well as between 
luminosity and mass (see the left and right panels in Fig. 0. 
In both these planes, our 28 objects sample shows e vidence 
for a correlation, although that considered first by iMateol 
(|l998t ) seems to be the one with the smallest scatter. 

It is curious that the correlation between mass and 
mass-to-light ratio is in fact the one with the largest scat- 
ter. This is slightly unexpected, since the mass and mass- 
to-light ratio plane is the one in which the use of the same 
mass diagnostic technique for all the objects is likely to have 
introduced the largest (regularizing) effect. By contrast, in 
the other two planes, the luminosity comes just as a label for 
the dSphs, which is completely external to the phace space 
modelling. This may suggest that the formations phases of 
the dSphs are characterized by a physical mechanism that 
links directly the mass size of their dark matter halo, and 
then potential well, to the extent and properties of the star 
formation history they have undergone. 

6.2.3 Segue 1 

Segue 1 has received considerable attention recently as 
one of the most promising targets for indirect detection of 
dark matter using the gam ma ray signal gene rated by self- 
annihilation of neutralinos l|Scott et alj |2010T >. This is be- 
cause Segue 1 is relati vely nearby (~ 2 3 kpc) and has a 
high Galactic latitude. iGeha et al.l (|2009l ) measured an in- 
ternal velocity dispersion of 4.3± 1.2 kms -1 , which led them 
to claim an enormous mass-to-light ratio within 50 pc of 
~ 1320. Our analysis is consistent with this re sult. Although 
the masses a re comparable to the results of IStrigari et al.l 
(|2007bl . I2008T ). there are reasons to be cautious. 

The case that Segue 1 is a dark-matter dominated dwarf 
g alaxy has been seriously under mined by the recent work 
of iNiederste-Ostholt et all (|2009T i. who showed that there is 
strong evidence from the Sloan Digital Sky Survey photom- 
etry for tidal effects in the outer parts of the object. This 
is hard to explain if indeed Segue 1 is embedded within a 
dark halo with mass ~ 10 6 , as its tidal radius would then 
be m uch larger than its half-light ra dius. Still more worry- 
ingly, INiederste-Ostholt e t al. (20QSJ) raised the possibility 
of contamination in the IGeha et al.l (|2009l ) sample, which 
may cause an artificial inflation of the central velocity dis- 
persion. Segue 1 is in a confused area of the sky, close to 
both leading an d trailing wraps of the S gr str eam and the 
Orphan stream. INiederste-Ostholt et al.l l|2009l ) showed that 
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Figure 10. The coefficient K (l.7Rh, R^), as defined in eqn \38f. 
displayed as a function of the ratio R^ for different families of 
phase space models. Black lines represent models with fiattish 
velocity dispersion profiles and with either a cored or a cusped 
halo that can be exponentially truncated to mimic tidal strip- 
ping (red dashed lines). More specifically, the models all satisify 
eqn J30II with 7 = 6. By contrast, the grey lines represent instead 
analogous - cored and cusped - models with a less restrictive 
requirement on the degree of flatness of the velocity dispersion 
profile (7 = 2). 



there are are kinematic signatures - which they identified 
with the trailing arm of Sgr - close to the heliocentric ve- 
locity of Segue 1. As these stars are also located at roughly 
the same distance and have roughly the same metallicity as 
Segue 1, it is difficult to distinguish them from true Segue 1 
stars, rendering contamination all but inevitable. The avail- 
able evidence is consistent with the interpretation that Segue 
1 is a star cluster, originally from the Sgr, and now dissolv- 
ing in the Milky Way, rather than a dark matter dominated 
dSph. 



6.2.4 Sgr 

The Sgr dSph has recen tly been reassembled by 
iNiederste-Qstholt et all l|2010l) , who reckoned that the lu- 
minosity in the core is ~ (3 — 4) x 10 7 L©. They also esti- 
mated that the total luminosity of the progenitor (that is, 
the core plus the stellar debris in the leading and trailing 
tidal tails) is ~ (10— 13) x 10 7 Lq, making it comparable to, 
though slightly fainter in brightness than, the Small Magel- 
lanic Cloud. They estimate that the total mass of the Sgr's 
dark matter halo prior to tidal disruption was ~ 1O 1O M . 

Now, the mass given in Table [5] pertains only to the 
inner parts of the Sgr core, namely within 1.7-Ri, ~ 2.6 kpc. 
If we extend our NFW model out to ~ 10 kpc to include 
all the Sgr core (see e.g., Figure 10 of Niederste-Ostholt 
et al. 2010) then the associated dark halo mass is ~ (8 — 
15) x 10 s A/0. Although the mass-to-light of the inner core 
is 25 (see Tableland Majewski et al. 2003), the mass-to- 
light ratio in the Sgr tidal streams is in excess of 100 in solar 
units. 



7 A SIMPLE MASS ESTIMATOR 

It is helpful to summarise our results in a simple approxi- 
mating expression which links the halo mass M(1.7Rh) to 
the available observation al data (<7 p ,o,flji) (c.f.. Illlingworthl 
1 19761 ; I Walker et alJlioiol ). The results listed in Table [5] lead 
to: 



M(1.7R h ) = K 



G 



(5.8 ±1 



G 



(38) 



The scatter in the value of the coefficient K is caused by 
at least two different effects. First, the value of the concen- 
tration chosen to construct Table [5] correspond, for differ- 
ent dSphs, to different ratios ro/Rh- However, as shown in 
Figs. [S] and [7] these differences generate rather small differ- 
ences in the enclosed mass M(1.7Rh). Secondly, the shape 
of the probability distributions in Fig. [5] is influenced by 
the importance of the observational errors; in particular the 
peak of the complete probability distribution does not al- 
ways coincide with the value of the mass we would obtain 
by simply ignoring the observational uncertainties. 

A slightly different version of eqn. 1|38[) can be obtained 
by analytic methods. Let us consider the halo mass function, 
which can be written as, 



M(r , 



Po(ro)rl g(r/r ) 



(39) 



and is valid for any choice of the halo density profile. We 
take the limit Rh/ro — > 0, appropriate for the case in which 
the stars are deeply embedded in the halo. Let us suppose 
that the central asymptotic behaviour of the halo density is 
p oc (ro/r) 2 ~ s , so that the gravitational potential in the cen- 
tral regions is simply $ oc (r/ro) S ■ We obtain g oc (r/ro) 1+S , 
while - using eqn <(9j - for the dimensionless half light radius 
we have i?h oc q . Through eqn. (|35p . this determines the 
asymptotic functional relation po(ro) between the inferred 
central density of the halo as a function of its characteris- 
tic scalelength : po oc r^~ 2 . By combining the information 
gathered so far in eqn. (|39|l . we get for the NFW profile 



lim Mnfw ! 

fl h /r -+0 



r ,1.7R h ) 



5.78 



G 
G ' 



(40) 



while, for any cored model, 

lim Mcore (ro, 1.7i4) 5 
R h /r o ^0 



K CTp 



6.8 



G 



(41) 



In terms of our phase space models, the coefficient K 
is clearly a function of the radius within which the mass 
is calculated (typically 1.7i?h in this paper) as well as the 
ratio Rh/ro = Rh for the specific halo model. In particular, 
we have 



K(R h ,1.7R h ) 



M(R h , 1.7i? h ) 



(42) 



r oPo{r ) 

Now, requiring an approximate flatness for the projected ve- 
locity dispersion profile through eqn. (|30fl , corresponds to se- 
lecting families of models whose coefficient K(Rh, 1.7i?h) is 
almost constant in R^, no matter what the precise dark mat- 
ter density distribution is. This is easy to see in Figure [lOl 
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which displays the coefficient K^R^, 1.7i?h) as a function of 
i?h for different families of models. The black lines and the 
associated transparent area represent the families of models 
used in this paper. They satisfy eqn. (|30[) out to 6 half-light 
radii (7 = 6) and they have been plotted before in Fig. [2] 
The cored halos require a slightly higher value of K, but the 
overall uncertainty on K is not larger than 20%, even if the 
exact shape and scale length of the dark matter profile is 
unknown. 

Red dashed lines in Fig. [10] display also analogous fam- 
ilies of models which are characterized by tidally truncated 
dark matter density distributions (truncated in an exponen- 
tial manner at 5 and 10 characteristic radii of the halo re- 
spectively, both for the NFW halo and for the cored halos). 
Note that, in this regard, tidal stripping does not represent 
a potentially dangerous unknown, as the uncertainty in K 
is not affected. By contrast, gray lines, and the associated 
gray shaded area represent families of models in which the 
requirement of eqn. (|30[) is much less restrictive and 7 = 2. 
In this case, the global uncertainty on K is larger than 50%. 



8 CONCLUSIONS 

There are many examples of the modelling of the dwarf 
spheroidals (dSphs) using the Jeans equations in the recent 
literature. Typically, a photometric profile (King or expo- 
nential or Plummer) is combined with a dark halo density 
law and an assumption as to the behaviour of the anisotropy 
parameter. From this, simply requiring that the model can 
hold itself up against the dark halo gravity field via the Jeans 
equations gives a correlation between the scalelengths of the 
luminous and the dark matter, and a mass estimate of the 
halo. 

Although straightforward and popular, it is not clear 
that Jeans modelling can teach us much more. A first, and 
well-known, problem is that the overwhelming majority of 
the solutions thrown up by Jeans modelling are unphysical. 
There is no phase space distribution function that can repro- 
duce the model. That is, there is no combination of stellar 
orbits that builds the stellar density and velocity field. The 
second, and ra ther more subtle p roble m, has been pointe d 
out recently bv lEvans etafl (|20Qg| ) and I An fc Evans! l|2009h . 
Seemingly innocent assumptions made concerning the light 
profile and the anisotropy law in the Jeans equations can 
lead to results that are not generic, but merely a conse- 
quence of imposing the initial conditions. This is particu- 
larly the case regarding inferences concerning the behaviour 
of the dark matter density at the very centre (for example, 
whether it is cusped or cored). 

The future should see more effort devoted to phase 
space modelling, in which a distribution of stellar orbits pro- 
vides the observables, namely the density and the velocity 
dispersion. This rich field has so far been scarcely touched. 
A pioneering effort is that of Wilkinson et al. (2002), who 
modelled the Draco dSph with distribution functions. In 
these models, the scalelengths of the luminous and dark 
matter are the same, which restricts their widespread ap- 
plicability. Wu's (2007) impressive work constructs axisym- 
metric two and three integral models for three classical dwarf 
spheroidals, Draco, Ursa Minor and Fornax. 

This paper has provided isotropic phase space models 



for all the Milky Way dSph galaxies. The observed flatness 
of the velocity dispersion profiles strongly suggests that the 
inner parts of the stellar populations are nearly isothermal, 
and so the families of lowered isothermal distribution func- 
tions made famous by Michie (1963) and King (1966) are 
natural starting points. For the dSphs, simple collisional re- 
laxation has a physical timescale which is too slow to ac- 
count for any central thermalization of velocities. However, 
a physical basis m ay be provided by the theory of tidal stir- 
ring (|Maver et alJI^OOlT ). in which dwarf irregular progeni- 
tors are transformed into dSphs by vigorous tidal shocking 
followed by bar and bending instabilities. 

The distribution functions are all isotropic in velocity 
space. First, from the point of view of good scientific prac- 
tice, the simplest assumption should be preferred until evi- 
dence to the contrary is found. In this respect, notice that 
the case of the dSphs is very different to that of ellipti- 
cal galaxies, in which there is strong evidence for velocity 
anisotropy from the plot of observe d flattening versus the ra- 
tio of ordered to random motions (llllingworthlll977l : iBinnevI 
Il978h . Second, it is striking that most of the studies allowing 
anisotropy laws (|Wu 1120071 ; lLo kas 2009) find fits suggesting 
that the dSph velocity distributions are nearly isotropic, es- 
pecially in the central regions. Third, numerical simulations 
of tidal stirring produce dSphs that are typically isotropic 
(see Figure 23 of Mayer et al. 2001). 

Clearly, though, the use of lowered isothermals is not 
the only possible way to construct models with a flatfish 
velocity dispersion profile. For example, a tangentially bias 
in the structure of the orbits that increases with increasing 
radius could provide similar kinematic profiles. However, it 
seems unlikely that a formation scenario based on a collapse 
and subsequent tidal stirring can favour a tangentially bi- 
ased velocity dispersion tensor in the outer parts. 

When lowered isothermal distribution functions are em- 
bedded in dark haloes, then the stellar distribution relaxes 
in the gravity field of the dark matter. This leads to a pre- 
diction as to the half-light radius as a function of the 
central velocity dispersion <t p ,o. Modulo the overall scalings 
of the characteristic halo scalelength and the central dark 
matter potential, we have shown that this leads in practice 
to a one-parameter family of models. If the luminous length 
scale is much less than the halo scalelength, then the re- 
lationship between R^ and er p ,o has a power-law form. In 
particular _R h oc <r Pi o for a cored halo, and _R h oc (jp for 
an NFW halo. For systems whose luminous scalelength be- 
comes comparable to the scalelength of the dark halo itself, 
significant deviations from a power-law correlation occur. 

To match the observational data for any dSph, the exis- 
tence of a theoretical curve between the dimensionless half- 
light radius and central velocity dispersion considerably re- 
stricts the set of suitable models. Note that solutions to the 
Jeans equations do not offer such a correlation - the set of 
physical solutions to the Jeans equations is overwhelmed by 
the infinitely more numerous unphysical solutions. In fact, 
in our approach, only one further ingredient is needed to fix 
the model, and this we choose as the halo scalelength (or 
equivalently the concentration of the dark halo) . Once this 
is fixed, then there is only one, for example, NFW model 
that can place the observed datum for a dSphs onto the 
theoretical curve. 

Even better, the mass within 1.7 times the half-light 
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radius is insensitive to the choice of halo scalelength. A 
larger halo scalelength enforces a compensating lower cen- 
tral dark matter density to ensure that the (i?h,o" Pl o) data- 
point lies on the theoretical curve, so that the mass interior 
to 1.7-Rh remains largely unchanged. This result - perhaps 
more surprisingly - is also true when the halo model itself is 
altered, for example from cusped to cored. Accordingly, this 
enables us to provide reasonably reliable mass estimates for 
almost all the Milky Way dSphs out to 1.7i?h- This result is 
valid for dSphs with flatfish a projected velocity dispersion 
profiles. 

The mas s resu lts do not support the conjecture of 
IStrigari et al.l ((2008) that all dSphs have a common mass 
scale within 300 pc. Hercules and Leo IV have a mass inte- 
rior to 300 pc of at most 3 x 1O 6 M0. It is probable that some 
of the more puny objects, like Segue 1 and Willman 1, also 
do not satisfy the putative common mass scale, but here the 
uncertainties of extrapolating the data on such physically 
small objects out to such a large distance as 300 pc does not 
allow us to be so definite. 

The two most massive of the Milky Way dSphs are the 
most luminous, Sgr and Fornax. Within 1.7 half-light radii, 
we estimate that Sgr has a mass of ~ 2.8 x 10 s Mq and 
Fornax ~ 1. 3 x W 8 Mp,. In particula r, we do not reproduce 
the result of lPefiarrubia et al.l l)2008h that physically smaller 
systems such as Draco and Sculptor are up to 5 times more 
massive than Fornax despite being roughly 70 times fainter. 
The least massive of the Milky Way satellites are Willman 1 
(~4x 10 5 Af Q ) and Segue 1 (~ 6 x 1O 5 M ). For 5 objects - 
namely Boo II, Leo IV, Leo V, Segue 2 and And XV - we are 
only able to provide an upper limit to the mass. It may well 
be that these objects are still more runty and unimpressive! 

We also notice that the inferred values for the mass-to- 
light ratios for some of the most luminous dSphs - Fornax 
and Leo I particularly - could in fact militate against the 
validity of the standard approximation that the luminous 
component as a simple tracer. Actually, and especially in 
the case of a cored dark halo, it is quite likely that the 
stellar component itself has a non negligible effect on the 
kinematic structure. The point that for some of the bright 
dSphs, stars may contribute equally with the dark matter in 
the central regions has been no ted before (|Lokas et al.ll2005l ; 
IStrigari. Frenk fc White!l2010l ). 

The discovery of so many new dSphs over the last few 
years has thrown open a rich field for theorists. It is clear 
that our earlier ideas of common mass scales and universal 
haloes are giving way under the wealth of new data. The 
methods introduced in this paper constitute the first steps in 
providing the dSphs with phase-space models. Here, we have 
used the data on the half-light radius and central velocity 
dispersion to provide models for the whole sample. We plan 
to complement this with detailed models for some of the 
individual brighter dSphs in the near future, using the full 
photometric and kinematic profiles. 
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APPENDIX A: ASYMPTOTIC ANALYSIS OF 
THE PARAMETER SPACE 

Here, we demonstrate some properties of the half-light ra- 
dius i?h in the two-dimensional parameter space (f t , a) by 
an analytic analysis in the asymptotic regimes. Note that 
we do not need to specify which dark matter density pro- 
files (|14[) . As long as the associated gravitational potential 
is regular at large radii, the asymptotic behaviour of the 
Rh(rt, <r) function is in fact the same. 

Let start with the case a 2 3> 1, in which case the quan- 
tity b defined in eqn (|26[) satisfies 6 < 1. The King density 
distribution Q24p then behaves like 



p*{r) 



P*,o 



81/' 



150F 



P*,o-. 



+ 0(b 7 ) 



$(r) - $(r t ) 
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(A2) 
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Equation (|A1[I implies that 

because the parameter a does not modify the profile of the 
stellar component p*(r), only its normalization. Thus, for 
any given value of the tidal radius r t , if the velocity disper- 
sion parameter is significantly higher than the central depth 
of the dark matter potential well, the half-light radius is 
constant. 

Let us consider now the regime f t ^> 1. For any value 
of a 2 , the tails of the luminous component are in the regime 
I) < 1 described by the asymptotic eq (|A1[) . The density 
profile has a divergent mass in the tails, because, as f% — > oo, 
the King distribution function reduces to the isothermal one. 
Let us indicate with 



M,, p (f t ) ~ n 



(A3) 
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the asymptotic behaviour of the projected total mass inside 
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the tidal radius. A power-law behaviour for the enclosed 
projected mass as in eqn (|A3f) is associated to a power-law 
behaviour of the gravitational potential as the radius goes 
to infinity. In particular, A = 1/2 for any regular potential 
for which $ ~ is r - >oo. We thus see that the half-light 
radius is linear in the tidal radius itself, because 

f t A ~ M», p (rt) = 2M„, p (A h ) ~ 2i? A . (A4) 

Furthermore, the ratio between the half-light radius and 
the tidal radius is fixed in the asymptotic regime (|A3[I as 
ft/Rh = 2 1//A , implying that ft/Rh ~ 4 for any regular dark 
matter potential. 

Now, let us consider the limit of small tidal radii ft <§C 1, 
in other words the luminous component is in the very centre 
of the potential well. For any value of a, the tidal radius can 
be made small enough so that the condition b <C 1 is realized 
over the entire radial profile. Repeating the argument used 
in eq (|A4[) . we see that the half-light radius is linear in the 
tidal radius and independent of the velocity dispersion pa- 
rameter. The coefficient of this proportionality depends on 
the properties of the dark matter profile only, and amounts 
to ft/Rh ~ 2.9 for an NFW profile and ft/Rh ~ 2.5 for any 
cored profile. 

Finally, the last is case a <fC 1. It is easy to see that, 
for any value of the tidal radius ft, the velocity dispersion 
parameter a can be made small enough such that the ratio 
ft/Rh is divergent. 

Fig. IA1I summarizes the properties of the models we 
recorded above by displaying the contours of the quantity 
k — ft/Rh in the (ft, a) plane. The parameter n will provide 
a useful way to characterise the theoretical a — Rh relations 
generated by these models. Note that the available parame- 
ter space is effectively divided into two regions: one for high 
values of a and/or of ft, in which k ^ 4 and one for small 
values of a in which k > 4. 



